29 min read
Intelligence Cartography The environment is no longer a passive test harness. It is a data engine. 10 dimensions of scaling, from task generation to multi-agent self-play.
The environment is no longer a passive test harness. It is a data engine. 10 dimensions of scaling, from task generation to multi-agent self-play.
13 interactive puzzle games with 1,872 levels inspired by The Witness, compatible with ARC-AGI-3 SDK and RL-ready via OpenEnv — an open-source training ground for teaching machines fluid intelligence
Re-examining mid-training as the strategic centerpiece of the LLM pipeline — how it builds the knowledge foundation for agentic RL, and how RL signals are now flowing backward to improve mid-training itself
A Step-by-Step Walkthrough using Slime and SWE-Bench as an Example
Low-Resource RLVR for Transforming Instruct Models into Reasoning Models